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The visibility algorithm has been recently introduced as a mapping between time series and com- 
plex networks. This procedure allows to apply methods of complex network theory for characterizing 
' time series. In this work we present the horizontal visibility algorithm, a geometrically simpler and 

(N : analytically solvable version of our former algorithm, focusing on the mapping of random series (se- 

^ ' ries of independent identically distributed random variables). After presenting some properties of 

I the algorithm, we present exact results on the topological properties of graphs associated to random 

. series, namely the degree distribution, clustering coefficient, and mean path length. We show that 

the horizontal visibility algorithm stands as a simple method to discriminate randomness in time 
series, since any random series maps to a graph with an exponential degree distribution of the shape 
P{k) = (l/3)(2/3)''~^, independently of the probability distribution from which the series was gen- 
erated. Accordingly, visibility graphs with other P{k) are related to non-random series. Numerical 
' simulations confirm the accuracy of the theorems for finite series. In a second part, we show that 

, the method is able to distinguish chaotic series from i.i.d. theory, studying the following situations: 

(i) noise-free low-dimensional chaotic series, (ii) low- dimensional noisy chaotic series, even in the 
presence of large amounts of noise, and (iii) high-dimensional chaotic series (coupled map lattice), 
without needs for additional techniques such as surrogate data or noise reduction methods. Finally, 
' O , heuristic arguments are given to explain the topological properties of chaotic series and several 

C/3 ' sequences which are conjectured to be random are analyzed. 

o 

, PACS numbers: 05.45.Tp, 89. 75. He, 05.45.-a 

Ph. 

I. INTRODUCTION 

' Recently, the visibility algorithm, a new tool for time series analysis, has been introduced |l|. The method, inspired 
fS| in the concept of visibility [2| , proceeds by mapping time series into graphs according to a specific geometric criterion, 
in order to make use of complex networks techniques \3tW\ for characterize time series (some other works based on 

, a similar philosophy can be found in [7, 8]). In short, a visibility graph is obtained from the mapping of a time 
ps) ■ series into a network according to the following visibility criterion: Two arbitrary data {ta,ya) and (t^, yi,) in the time 

' series have visibility, and consequently become two nodes in the associated graph, if any other data {tc,yc) such that 
O . ta <tc <tb fulfills 

yc < ya + {yb - ya)*-^ — ^- (i) 

: ^ ■ tb — ta 

XI 

It has been shown 1] that time series structure is inherited in the associated graph, such that periodic, random and 



(N 



fractal series map into motif- like, random exponential and scale-free networks [9Hll|. respectively. These findings 
suggest that the visibility graph may capture the dynamical fingerprints of the process that generated the series. 
Furthermore, it has been recently pointed out that this algorithm stands as a method for estimating the Hurst 
exponent H in fractional Brownian series [l2l] , since a linear relation between H and the exponent 7 of the power law 
degree distribution in the scale free associated visibility graph exists 13]. While being relatively new, some applications 
of the method to analyze time series, in different contexts from fiuid dynamics 14] or atmospheric sciences [l5j | to 
finance [l3], have been presented so far. 
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FIG. 1: Illustrative example of the visibility algorithm. In the upper part we plot a periodic time series and in the bottom 
part we represent the graph generated through the visibility algorithm. Each datum in the series corresponds to a node in the 
graph, such that two nodes are connected if their corresponding data heights fulfill the visibility criterion of equation [1] Note 
that the degree distribution of the visibility graph is composed by a finite number of peaks, much in the vein of the Discrete 
Fourier Transform of a periodic signal. We can thus interpret the visibility algorithm as a geometric transform. 

What does the visibility algorithm stand for? In order to deepen on the geometric interpretation of the visibility 
graph, let us focus on a periodic series. It is straightforward that its visibility graph is a concatenation of a motif: a 
repetition of a pattern (see figure [J). Now, which is the degree distribution P{k) of this visibility graph? Since the 
graph is just a motif's repetition, the degree distribution will be formed by a finite number of non-null values, this 
number being related to the period of the associated periodic series. This behavior reminds us the Discrete Fourier 
Transform (DFT), which for periodic series is formed by a finite number of peaks (vibration modes) related to the 
series period. Using this analogy, we can understand the visibility algorithm as a geometric (rather than integral) 
transform. Whereas a DFT decomposes a signal in a sum of (eventually infinite) modes, the visibility algorithm 
decomposes a signal in a concatenation of graph's motifs, and the degree distribution simply makes a histogram of 
such 'geometric modes'. While the time series is defined in the time domain and the DFT is defined on the frequency 
domain, the visibility graph is defined on the 'visibility domain'. This is, of course, a hand-waving analogy and 
further work should study its extent rigorously. For instance, this transform is not, as presented, a reversible one. 
Reversibility can however be easily obtained weighting the links in the visibility graph with the slope of the visibility 
line that links the associated data heights. The weighted version of the algorithm and its geometric transform nature 
will be addressed elsewhere. At this point we can comment that whereas a generic DFT fails to capture the presence 
of nonlinear correlations in time series (such as the presence of chaotic behavior), in the second part of this paper 
we will show that the visibility algorithm can clearly distinguish between white noise (i.e. a sequence of identically 
independent random variables) and chaotic series. 

Of course the latter analogy is, so far, a simple metaphor to help our intuition. Indeed, while some analytical results 
have already been put forward within the visibility algorithm [l|, [l3j (typically making use of concepts borrowed from 
extreme value theory), no rigorous theory for the visibility algorithms exists so far. Our goal in this work is to make 
the first steps in that direction, providing results on the properties of the visibility graphs associated to random series. 
In order to derive exact results, we present here a slight modification of the algorithm that we call the horizontal 
visibility algorithm, which is essentially similar to the former yet having a geometrically simpler visibility criterion. 
According to this latter criterion, the generated horizontal visibility graph stands as a subgraph of the visibility graph 
We will prove that, surprisingly, the horizontal visibility graph associated to any random series is a Small- World 'lO[ 
random graph with a universal exponential degree distribution of the form P{k) — (1/3) • (2/3)'^^^, independently of 
the probability distribution f{x) from which the series was generated. Accordingly, the horizontal visibility algorithm 
stands as an extremely simple test for randomness, that for instance, can easily distinguish random series from chaotic 
ones. The remaining of this paper goes as follows: in section II we introduce the horizontal visibility algorithm, a 
geometrically simpler version of the visibility algorithm that allows analytical tractability, along with some of its 
properties. In section III we derive exact results for the degree distribution P{k) of the associated graph to a generic 
random time series. In section IV exact results on other properties of horizontal visibility graphs are also derived, 
concretely (i) P{k\x), the probability that a node associated to a datum of height x has degree k, (ii) the clustering 
distribution P{C), (iii) the probability of long distance visibility Pin) and (iv) an estimation of its mean path length 
L{N) (sections III and IV contain technical proofs that the non- interested reader can eventually skip). In section V 
we study the reliability of the algorithm to discriminate chaotic series from our theory. For this task we calculate the 
degree distribution of visibility graphs associated to (i) low-dimensional chaotic series (logistic map, Henon map), (ii) 
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noisy low-dimensional chaotic series (with amounts of noise of 100% by amplitude), and (iii) high-dimensional chaotic 
series (coupled map lattice). In every case, the algorithm easily distinguishes those series from a series of i.i.d. random 
variables (white noise). At this point we also conjecture that the topological properties of graphs associated to chaotic 
series are related to the statistics of Poincare recurrence times In section VI we make use of the method as a 
randomness test, and study some number theoretical sequences that are conjectured to be normal (decimal expansion 
of normal numbers [18]). We finally provide some concluding remarks in section VII. 



II. HORIZONTAL VISIBILITY ALGORITHM 
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FIG. 2: Illustrative example of the horizontal visibility algorithm. In the upper part we plot a time series and in the bottom 
part we represent the graph generated through the horizontal visibility algorithm. Each datum in the series corresponds to a 
node in the graph, such that two nodes are connected if their corresponding data heights are larger than all the data heights 
between them. The data values (heights) are made explicit in the top. 

The horizontal visibility algorithm maps time series into graphs and it is defined as follows. Let {xjji^i.jv be a time 
series of N data. The algorithm assigns each datum of the series to a node in the horizontal visibility graph (graph 
from now on). Two nodes i and j in the graph are connected if one can draw a horizontal line in the time series joining 
Xi and Xj that does not intersect any intermediate data height (see figure [2] for a graphical illustration). Hence, i and 
j are two connected nodes if the following geometrical criterion is fulfilled within the time series: 

Xi, Xj > Xn for all n such that i < n < j . (2) 

This algorithm is a simplification of the so called visibility algorithm [T| that has been recently introduced. As a 

matter of fact, notice that given a time series, its horizontal visibility graph is always a subgraph of its associated 

visibility graph. Accordingly, as in the former case, the horizontal visibility graph associated to a time series is always: 

(«) Connected: each node sees at least its nearest neighbors (left-hand side and right-hand side). 

(m) Invariant under affine transformations of the series data: the visibility criterium is invariant under rescaling of 

both horizontal and vertical axis, as well as under horizontal and vertical translations. 

Some other properties can be stated, namely: 

(iii) Reversible/Irreversible character of the mapping: note that some information regarding the time series is in- 
evitably lost in the mapping from the fact that the network structure is completely determined in the (binary) adja- 
cency matrix. For instance, two periodic series with the same period as Tl = 3, 1, 3, 1, ... and T2 = 3, 2, 3, 2, ... 
would have the same visibility graph, albeit being quantitatively different. Although the spirit of the visibility graph 
is to focus on time series structural properties (periodicity, fractality, etc.), the method can be trivially generalized 
by making use of weighted networks (where the adjacency matrix is not binary and the weights determine the height 
difference of the associated data), if we eventually need to quantitatively distinguish time series like Tl and T2, for 
instance. Using weighted networks, the algorithm trivially converts to a reversible one. 

(iv) Undirected/directed character of the mapping: Although this algorithm generates undirected graphs, note that 
one could also extract a directed graph (related to the temporal axis direction) in such a way that for a given node 
one should distinguish two different degrees: an ingoing degree fci„, related to how many nodes see a given node i. 
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and an outgoing degree fcout, that is the number nodes that node i sees. In that situation, if the direct visibihty 
graph extracted from a given time series is not invariant under time reversion (that is, if P(fci„) 7^ P{kout))^ one could 
assert that the process that generated the series is not conservative. In a first approximation we have studied the 
undirected version, and the directed one will be eventually addressed in further work. While the undirected choice 
seems to violate causality, note that the same 'causality violation' is likely to take place when performing the DFT of 
a time series, for instance. 

{vi) Comparison between geometric criteria: Note that the geometric criterion defined for the horizontal visibility 
algorithm is more 'visibility restrictive' than its analogous for the general case. That is to say, the nodes within the 
horizontal visibility graph will have 'less visibility' than their counterparts within the visibility graph. While this fact 
does not have an impact on the qualitative features of the graphs, quantitatively speaking, horizontal visibility graphs 
will have typically 'less statistics'. For instance, it has been shown that the degree distribution P{k) of the visibility 
graph associated to a fractal series is a power law P{k) ~ , such that the Hurst exponent H of the scries is linearly 
related to 7 [isj . Now, for practical purposes it is more recommendable to make use of the visibility algorithm (in 
detriment of the horizontal version) when measuring the Hurst exponent of a fractal series, since a good estimation 
of 7 requires at least two decades of statistics in P(fc), something which is more likely within the visibility algorithm. 
In what follows we will show that the simplicity of the horizontal version of the algorithm -which is computationally 
faster than the original- allows analytical tractability, and nonetheless, this latter method is well fitted to distinguish 
different degrees of chaos from a sequence of uncorrelated random variables. 



III. DEGREE DISTRIBUTION OF THE VISIBILITY GRAPH ASSOCIATED TO A RANDOM TIME 

SERIES 

Consider a bi-infinite time series created from a random variable X with probability distribution f{x) with x € [0, 1] 
and let us construct its associated horizontal visibility graph (note that if the distribution's support is a generic interval 
X G [a, 6], we can rescale to [0, 1] without loss of generality since the associated graph remains invariant, and this also 
applies to unbounded supports). For convenience, we will label a generic datum xq as the 'seed' datum from now on. 
In order to derive the degree distribution P{k) Q of the associated graph, we need to calculate the probability that 
an arbitrary datum with value xq has visibility of exactly k other data. If xq has visibility of k data, there always 
will exist two 'bounding data', one on the right-hand side of xq and another one on its left-hand side, such that the 
fc — 2 remaining visible data will be located inside that window (in fact, fc = 2 is the minimum possible degree). As 
these 'inner' data should appear sorted by size, there are exactly k — 1 different possible configurations {Ci}i=o..fc-2, 
where the index i determines the number of inner data on the left-hand side of xq (see figure [3] for an illustration of 
the possible configurations and a labelling recipe of the data in the case fc = 4). Accordingly, Ci corresponds to the 
configuration for which i inner data are placed at the left-hand side of xq, and k — 2 — i inner data are placed at its 
right-hand side. Each of these possible configurations have an associated probability pi = p{Ci) that will contribute 
to P{k) such that 

fe-2 

P{k) = Y.P,. (3) 

Before trying to find a general relation for P{k) and for illustrative purposes, let us study some particular cases. The 
first and simplest case is P{k — 2), that is, the probability that the seed data has two and only two visible data, the 
minimum degree. These obviously will be the bounding data, that we will label x_i and xi for left-hand side and 
right-hand side of the seed respectively. The probability that a;o sees fc > 2 is 1 by construction, since the horizontal 
visibility algorithm assures that any data will always have visibility of its first neighbors. Now, in order to assure that 
fc = 2, we have to impose that the bounding data neighbors have a larger height than the seed, that is, x_i > xq and 
xi> Xq. Then, 

P(fc = 2) =Prob(a;_i,xi >0) = / f{xo)dxo f f{xi)dxi f fix^i)dx^i. (4) 

Now, the cumulative probability distribution function F{x) of any probability distribution /(x) is defined as 

F{x) = r f{x')dx', (5) 
Jo 

where dF{x)/dx — f{x), F{0) = and F{1) = 1. In particular, the following relation between / and F holds: 



(6) 



FIG. 3: Set of possible configurations for a seed data xq with fc = 4. Observe that the sign of the subindex in Xi indicates if the 
data is located whether at left-hand side of xq (sign minus) or at right-hand side. Accordingly, the bounding's data subindex 
directly indicates the amount of data located in that side. For instance, Co is the configuration where none of the fc — 2 = 2 
inner data are located in the left-hand side of xq, and therefore the left bounding data is labeled as X-i and the right bounding 
data is labeled as x^. Ci is the configuration for which an inner data is located in the left-hand side of xq and another inner 
data is located in its right-hand side. Finally, C2 is the configuration for which both inner data are located in the left-hand 
side of the seed. Notice that an arbitrary number of hidden data can be eventually located among the inner data, what is 
schematically represented in the figure as a row of vertical lines. 

We can accordingly rewrite and compute equation 2] as 

P{k = 2)= f /(xo)[l - F{xo)]'dxo - i (7) 
Jo •J 

independently of the shape of the probability distribution f{x). 



Let us proceed by tackling the case P{k — 3), that is, the probability that the seed has three and only three visible 
data. Two different configurations arise: C'o, in which xq has 2 bounding visible data (a;_i and X2 respectively) and 
a right-hand side inner data (a;i), and the same for Ci but with the inner data being place at the left-hand side of 
the seed, so 

Pik = 3) = p{Co) + p{Ci) = po + Pi. 

Notice at this point that an arbitrary number r of hidden data ni,n2...nr can eventually be located between the 
inner data and the bounding data, and this fact needs to be taken into account in the probability calculation. The 
geometrical restrictions for the rij hidden data are Uj < xi, j = for Co and rrij < a;_i, j = l,...,s for Ci. 

Then, 

PO = Pmh(^{x^i,X2 > Xq) n {Xi < Xq) n {{Uj < Xi}.j^i,,,,,r)j , 

Pi = Prohi{x^2,xi>XQ)r]{x^i<xo)r]{{mj<x^i}j^i^„„s)). (8) 



Now, we need to consider every possible hidden data configuration (Cq without hidden data, Co with a single hidden 
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data, Co with two hidden data, and so on, and the same for Ci). With a httle calculus we come to 
Po= f{xo)dxo / f{x-i)dx^i / f{x2)dx2 / f{xi)dxi + 

JQ Jxo Jxo Jt) 

X! / fMdxo / f{x^i)dx-i / f{x2)dx2 / /(a;i)da;i J| / f{nj)dnj 

r=iJO Jxo Jxo Jo i=l"^° 

where the first term corresponds the contribution of a configuration with no hidden data and the second sums up the 
contributions of r hidden data. Making use of the properties of the cumulative distribution F(x) we arrive to 

Po = / f{xo)dxo / f{x^i)dx^i / f{x2)dx2 / —, — rdxi, (9) 

Jo Jxo Jxo Jo l-F{Xi) 

where we also have made use of the sum of a geometric series. We can find an identical result for pi, since the last 
integral on equation [S] only depends on xq and consequently the configuration provided by Ci is symmetrical to the 
one provided by Cq. We finally have 

P{k = 3) = 2po = -2 ^ /(^o)(l - F{xo)f ln(l - F{xo))dxo = | (10) 

where the last calculation also involves the change of variable z = 1 — F{x). Again, the result is independent of f{x). 

Hitherto, we can deduce that a given configuration Ci contributes to P{k) with a product of integrals according to 
the following rules: 

• The seed data provides a contribution of f{xo)dxo (S). 

• Each boundary data provides a contribution of J^^ f{x)dx (B). 

• An inner data provides a contribution (^p'^^-^ (I). 

These diagrammatic-like rules allow us to schematize in a formal way the probability associated to each configuration. 
For instance in the case k = 2, P{k) has a single contribution pq represented by the formal diagram B-S-B, while for 
k — 3, P{k) — pi^ + pi where po's diagram is B-S-TB and piS is B-I-S-B. It seems quite straightforward to derive a 
general expression for P{k), just by applying the preceding rules for the contribution of each Ci. However, there is 
still a subtle point to address that will become evident for the case P{k = 4) = po +Pi +P2 (see figure While in 
this case Ci leads to essentially the same expression as for both configurations in fc = 3 (and in this sense one only 
needs to apply the preceding rules to derive pi), Cq and C2 are geometrically different configurations. These latter 
ones are configurations formed by a seed, two bounding and two concatenated inner data, and concatenated data lead 
to concatenated integrals. For instance, applying the same formalism as for fc = 3, one come to the conclusion that 
for fc = 4, 

[\i ^w r n^i)dxi r° f{x2)dx2 r\, f\, ,,,, 

PO^ / f{xo)dxo / ^--^ / / /(xg / f{x_i)dx_i. 11) 

Jo Jo ^-F{xi)J^^ l-F[x2)J^^ ^„ 

While for the case fc = 3 every integral only depended on xq (and consequently we could integrate independently 
every term until reaching the dependence on Sq), having two concatenated inner data on this configuration generates 
a dependence on the integrals and hence on the probabilities. For this reason, each configuration is not equiprobable 
in the general case, and thus will not provide the same contribution to the probability P{k) [k — 3 was an exception 
for symmetry reasons). In order to weight appropriately the effect of these concatenated contributions, we can make 
use of the definition olpi. Since P{k) is formed by /c — 1 contributions labelled Co, Ci...Cfe_2 where the index denotes 
the number of inner data present at the left-hand side of the seed, we deduce that in general the /c — 2 inner data 
have the following effective contribution to P{k): 

• po has k — 2 concatenated integrals (right-hand side of the seed). 

• pi has fc — 3 concatenated integrals (right-hand side of the seed) and an independent inner data contribution 
(left-hand side of the seed). 
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• p2 has fc — 4 concatenated integrals (right-hand side of the seed) and another 2 concatenated integrals (left-hand 
side of the seed). 



• Pk-2 has fc — 2 concatenated integrals (left-hand side of the seed). 
Observe that pi is symmetric with respect to the seed. 

Including this modification in the diagrammatic rules, we are now ready to calculate a general expression for P{k). 
Formally, 

k — 2 

Pik)^j2is][miw]k-2-„ (12) 

where the sum extends to each of the fc — 1 configurations, the superindex denotes exponentiation and the subindex 
denotes concatenation (this latter expression can be straightforwardly proved by induction). In order to solve it, one 
needs to firstly calculate the concatenation of n inner data integrals [/]„ = /(n), that is 

f{xi)dxi j-j Z"^" f{x.j+i)dxj 



^ > Jo l-F{x,) IW,, l~Fix,+,) ^ ' 



The calculation of I{n) is easy but quite tedious. One proceeds to integrate equation [T2] step by step (first n 
then n = 2, and so on), and a recurrence quickly becomes evident. One can easily prove by induction that 



ln(l-F(a;o)) 



(14) 



n 

According to the formal solution [T^] and to equation [T31 we finally have 

^(^) = E -UkX^'-V f fM[l-Fixo)n\nil-Fixo))]'-'dxo 

j—Q ' ^ J ) ' <J 

^1,,^^ (fc-2)! 1/2^-^ 
= ' g,!(fc-2-,)! = 3Uj ^''^ 

Surprisingly, we can conclude that for every probability distribution f{x), the degree distribution P{k) of the associated 
horizontal visibility graph has the same exponential form. 

In order to check further the accuracy of our analytical results for the case of finite time series, we have performed 
several numerical simulations. We have generated random series of 10^ data from different distributions f{x) and 
have generated their associated horizontal visibility graphs. In figure 2] we have plotted the degree distribution of 
the resulting graphs (triangles correspond to a series extracted from a uniform distribution, while circles and squares 
correspond to one extracted from a Gaussian and a power law distribution f{x) ^ x^^ respectively). The solid line 
corresponds to the theoretical equation 1151 showing a perfect agreement with the numerics. 



IV. SOME OTHER TOPOLOGICAL PROPERTIES OF THE VISIBILITY GRAPH 



A. Degree versus height 

An interesting aspect worth exploring is the relation between data height and the node degree, that is, to study 
whether a functional relation between the height of a datum and the degree of its associated node holds. In this 
sense, let us define P{k\x) as the conditional probability that a given node has degree fc provided that it has height 
X. Observe that P{k\x) can be easily deduced from eq. [151 such that 
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FIG. 4: Semi-log plot of the degree distribution of an horizontal visibility graph associated to random series of 10® data 
extracted from a uniform distribution f{x) = U[0, 1] (triangles), a Gaussian distribution (circles), and power law distribution 
f{x) ~ (squares). Solid line corresponds to equation 14. 
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FIG. 5: Average degree of a node, as a function of the associated datum's height: (circles) numeric results from a random 
series of 10® data extracted from a uniform distribution f{x) — U[0, 1]. The solid line corresponds to the theoretical prediction 
ea ll7l showing a perfect agreement. It comes evident that the hubs stand for the nodes associated to the data with larger 
values (extreme events). 

Notice that probabilities are weW normalized and that X]feL2^(^l^) ~ independently of x. Now, we can define an 
average value of the degree of a node associated to a datum of height x, K{x), in the following way 



oo 

K{x) = ^fcP(fc|x) = 2 - 21n(l - F{x)). 

k-2 



(17) 
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Since F{x) G [0, 1] and ln(a;) are monotonically increasing functions, K{x) will also be monotonically increasing. We 
can thus conclude that graph hubs (that is, the most connected nodes) are the data with largest values, that is, the 
extreme events of the series. 

In order to check the accuracy of the theoretical prediction within finite series, in fig. [5] we have plotted (circles) the 
numerical values of K{x) within a random series of 10^ data extracted from a uniform distribution with F[x) = x. 
The line corresponds to eq. 1171 showing a perfect agreement. 



B. Local clustering coefflcient distribution 

The local clustering coefficient C 0-0, 0] of an horizontal visibility graph associated to a random series can be easy 
deduced by means of geometrical arguments. For a given node i, C denotes the rate of nodes connected to i that are 
connected between each other (observe that in this section, C denotes clustering: do not mistake this with the 'C 
(configuration) of section III). In other words, we have to calculate from a given node i how many nodes from those 
visible to i have mutual visibility (triangles), normalized with the set of possible triangles (2). In a first step, if a 
generic node i has degree fc = 2, these nodes are straightforwardly two bounding data, hence having mutual visibility. 
Thus, in this situation there exists 1 triangle and C(fc = 2) = 1. Now if a generic node i has degree fc = 3, one of 
its neighbors will be an inner data, which will only have visibility of one of the bounding data (by construction). 
We conclude that in this situation we can only form 2 triangles out of 6 possible ones, thereby C(fc = 3) = 2/6. In 
general, for a degree k we can form fc — 1 triangles out of (2) possibilities, and then: 

C(k) = t_i = I (18) 

what indicates a so called hierarchical structure [20] ■ This relation between k and C allows us to deduce the local 
clustering coefficient distribution P{C): 

m^KlX ' = p{vc) 

2/C-2 

To check the validity of this latter relation within finite series, in figure [6] we depict the clustering distribution of 
an horizontal visibility graph associated to a random series of 10^ data (dots) obtained numerically. The solid line 
corresponds to the theoretical prediction (equation [T51) . in excellent agreement with the numerics. 
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C. Long distance visibility, mean degree and mean path length 

In order to derive the scaling of the mean path length Q, let us first calculate the probability P{n) that two data 
separated by n intermediate data be two connected nodes in the graph. Consider again a time series extracted from 
a random variable X with probability distribution f{x) and x G [0, 1], and let us construct its associated horizontal 
visibility graph. An arbitrary value xq from this series will 'see' Xn (and consequently will be connected to node Xn 
in the graph) iff Xi < min(a::o, Xn) for all Xi, i = 1,2, n — I. Then P{n) can be expressed as: 

P{n) ^ / / f{xo)f{xn)dxodxn 

Jo Jo JO 

min(a;o,a;Ti) 

f{xi)...f{Xn^l)dXi...dXn-l (20) 

Since the integration limits are independent, rewriting x = min(a:;o, Xn) we have 

Pin)= f f f{xo)f{xn)F''-^(x)dxodxn. (21) 
Jo Jo 
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FIG. 6: Semi-log plot of the clustering distribution of an horizontal visibility graph associated to random series of 10 data 
extracted from a uniform distribution f{x) = U[0, 1] (dots). The solid line corresponds to the theoretical prediction P{C) = 
{l/3)(2/3)^^^~^ . In order to avoid border effects we have imposed periodic boundary conditions in the data series. 



We can fix xo and move Xn without loss of generality, such that the latter equation can be expressed as 



1 rxo ^1 



P{n)^ / f{xo)f{x,,)F^-\x,,)dxodXr,+ / /(a;o)/(a;„)F"-i(xo)dxodx„ (22) 

Jo Jo Jo J Xo 



tho minimum hero is x„ the minimum here is xq 

Applying the definition of F{x) and the relation |6l with a little calculus we get 

P{n) = (^-l)^ f{xn)F^{xo)dxo + 1^ f{xo)F"^^(xo)dxo 
2 



n{n + 1) 



(23) 



Observe that P{n) is again independent of the probability distribution of the random variable X. Notice that the 
latter result can also be obtained, alternatively, with a purely combinatorial argument that reads as it follows. Take 
a random series with n+1 data and choose its two largest values. This latter pair can be placed with equiprobability 
in n{n + 1) positions, while only two of them are such that the largest values are placed at distance n, so we get 
-^("') ^ n{n+i) '^^ agreement with the previous development. 

At this point, we can calculate the mean degree < fc > of the horizontal visibility graph: 

<k>=J2kP{k) = Y^^i^\ =4, (24) 

that we can recover from P(n) as: 

oo 

< fc 2^P(n) = 4. (25) 

n=l 

Now, for illustration purposes, in figure[7]we show the adjacency matrix of the horizontal visibility graph associated 
to a random series of lO'^ data (the entry j, j is filled in black if nodes i and j are connected, and left blank otherwise). 
Since every data Xi has visibility of its first neighbors Xi^i, x^+i, every node i will be connected by construction 
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FIG. 7: Adjacency matrix of an horizontal visibility graph associated to a random series of 10^ data. 

to nodes i — 1 and i + 1: the graph is thus connected. Observe in figure [7] that the graph evidences a typical 
homogeneous structure: the adjacency matrix is predominantly filled around the main diagonal. Furthermore, the 
matrix evidences a superposed sparse structure, reminiscent of the visibility probability P{n) = 2/{n{n + 1)) that 
introduces some shortcuts in the horizontal visibility graph, much in the vein of the Small- World model [10]. Here the 
probability of having these shortcuts is given by P{n). Statistically speaking, we can interpret the graph's structure 
as quasi-homogeneous, where the size of the local neighborhood increases with the graph's size. Accordingly, we can 
approximate its mean path length L{N) as: 

AT-l N-l 2 

L{N) « = E = 2 log(^) + 2(7 - 1) + 0{l/N), (26) 

^ — ' ^ — ' n + 1 

n— 1 n— 1 

where we have made use of the asymptotic expansion of the harmonic numbers and 7 is the Euler-Mascheroni constant. 
As can be seen, the scaling is logarithmic, denoting that the horizontal visibility graph associated to a generic random 
series is Small- World [l3| , according to what figure 6 suggested. In figure [8] we have plotted the numerical results of 
L{N) (dots) of an horizontal visibility graph associated to several random series of increasing size N — 2^, 2^, . . . 2^^. 
The solid line corresponds to the best fit L{N) = 1.31og(A^) — 1.7. 

V. APPLICATION OF THE THEORY TO DISCRIMINATE CHAOTIC SERIES 

So far we have presented exact results on the topological properties of graphs associated to series of i.i.d. random 
variables (random series from now on) via the horizontal visibility algorithm. The very first application of this 
theory can be found in the task of discriminating a random signal from a chaotic one. The task of identifying 
random processes and more concretely discriminating (low dimensional) deterministic chaotic systems from stochastic 
processes has been extensively studied in the last decades (see for instance (2l| - [27j ). Essentially, all methods that have 
been introduced so far rely on two major points: Firstly, chaotic systems have a finite dimensional attractor, whereas 
stochastic processes arise from an infinitely dimensional one. Being able to reconstruct this latter attractor is thus 
a clear evidence showing that the time series has been generated by a deterministic system. Secondly, deterministic 
systems evidence, as opposed to random ones, short-time prediction: the difference between the time evolution 
of two nearby states will remain rather low for regular systems and increase exponentially fast for chaotic ones, 
while for stochastic processes this difference should be randomly distributed. Whereas several algorithms relying 
on the preceding concepts are nowadays available, the great majority of them are purely numerical and/or usually 
complicated to perform, computationally speaking (these difficulties are eventually more acute for noisy series [28| 
or high dimensional chaotic ones [29j). Furthermore, even the discrimination between a chaotic series and a series 
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FIG. 8: Mean path length L{N) of an horizontal visibility graph associated to random series of A'^ = 2^, 2* 
The solid line corresponds to the better fit L{N) — 1.31og(A'") — 1.7. 



.2^'' data (dots). 



— random (theoretical) 

□ logistic map p. = 4 

> Henon map (x variable) 




FIG. 9; Semi-log plot of the degree distribution of several horizontal visibility graphs associated to: (solid line) theoretical 
prediction for random series (equation llSp . (squares) time series of lO'' extracted from a logistic map xt+i = iJ,Xt{l — xt) in 
the fully chaotic region /i = 4, (black triangles) time series of lO'' extracted from x variable of the Henon map {xt+i,yt+i) = 
(j/t + 1 — axi, hxt) in the fully chaotic region (a — 1.4, h — 0.3). 



of i.i.d. random variables, something that an autocorrelation function or a power spectra fails to do but some other 
methods such as recurrence plots can 32| is nontrivial when the chaotic degree of the series is high, or even when 
such series is polluted with noise. All these complications provide motivation for a search for new methods that 
can directly distinguish, in a reliable way, random from chaotic time series, prior to quantifying the dimension [s^l 
and without needs for additional sophisticated techniques such as surrogate data i31 | or noise reduction methods 
[28j . In the preceding sections we have proved that the horizontal visibility graph associated to a random series 
has well-defined and universal degree distribution, local clustering distribution and P(n), independent of the shape 
of the random probability distribution /(x). These theorems guarantee that horizontal visibility graphs with other 
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topological properties are not uncorrelated random series. In what follows we explore the reliability of the method to 
distinguish uncorrelated randomness from chaos in finite series. 

A. Low-dimensional chaos 

In order to test the practical usefulness of this method, we have generated the horizontal visibility graph of several 
noise-free chaotic series, and have calculated numerically their degree distribution. We have restricted our analysis to 
discrete systems (maps), but the method is also extensible to flows (in that case the null hypothesis would no longer 
be white noise but Brownian motion [l3|)- In figure [9] we have plotted in semi- log the results of these simulations. 
Compare it with figure 3. In every case and by simple visual inspection we can conclude that P{k) deviates from 
equation 1151 the method is able to easily distinguish randomness from low-dimensional chaos (similar results are 
obtained with P{n) and P{C), but P{k) works better as discriminator). 

Observe at this point that if we shuffle the series data and reproduce the analysis, we would find a degree 
distribution that now would correspond to equation 1151 since shuffling breaks the temporal correlations of the series: 
such shuffled series would be equivalent to a random series extracted from a probability distribution equal to the 
system's probability measure (the beta distribution in the case of the Logistic map). We can deduce that the 
algorithm captures temporal correlations of time series, and that P{k) plays the role of an autocorrelation function, 
but with the additional ability of capturing nonlinear correlations. Observe also that this method neither works on 
the time nor on the frequency domain, since it only makes use of topological features. 



B. Noisy chaotic series 

It is well known that standard methods evidence problems when noise is present in chaotic signals, since even a small 
amount of noise can destroy the fractal structure of a chaotic attractor and mislead the calculation of chaos indicators 
such as the correlation dimension or the Lyapunov exponents j28| . In order to check the algorithm's robustness, we 
have introduced an amount of white noise (measurement noise) in a signal extracted from a fully chaotic Logistic map 
(/i = 4.0). In figure [in] we plot the degree distribution of its associated visibility graph. Remarkably, the algorithm 
still discriminates noisy chaotic behavior from randomness even when the noise level reaches the 100% of the signal 
amplitude. 
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FIG. 10: Semi-log plot of the degree distribution of an horizontal visibility graph associated to: (triangles) noisy chaotic series 
of 10^ data extracted from the Logistic map [fj, — 4) with a measurement noise level of 10% (by amplitude), (circles) idem but 
for noise level of 100%. The solid line corresponds to the theoretical prediction for random series P{k) — (1/3) (2/3)*^"^. 
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C. High dimensional chaos: Coupled Map Lattice 



Standard methods of phase space reconstruction also demonstrate computational problems when the attractor 
dimension is large [29[. Here we have generated high dimensional chaotic series by means of the so called Coupled 
Map Lattice (CML) |33l, a paradigmatic formalism for spatiotemporal chaos, widely used to model chaotic extended 
systems including fully-developed turbulence and pattern formation problems. We have coupled N = 1000 Logistic 
maps xl, i = 1, ...,N such that xl_^_i = f{e/2xl~^ + (1 - + e/2a:J+^), where f{x) = 4x(l - x) and e = 0.4 is the 
coupling strength (note that such a system exhibits high dimensional chaos with an estimated attractor dimension 
Z?2 — 800 29] ) . In figure [TT] we have plotted the degree distribution of its associated visibility graph along with the 
theoretical prediction for a random series. While the deviations from eq. [15] are not as evident as for low dimensional 
chaotic series, a clearly rejects the hypothesis of randomness: the method distinguishes randomness from high 
dimensional chaos. 




FIG. 11: Semi-log plot of the degree distribution of an horizontal visibility graph associated to: (squares) chaotic series of 10^ 
data extracted from a CML of 1000 Logistic maps with jj. — 4 and a coupling strength e = 0.4 (see the text). The solid line 
corresponds to the theoretical prediction for random series P{k) = (l/3)(2/3)*~^. Inner box: values of goodness of fit test 
between eq. 1151 and (i) a random series extracted from a uniform distribution (reference value), (ii) the CML series. While 
visually speaking the deviations from ea llSI are not here as evident as in the case of low dimensional chaotic series, a test 
clearly rejects the hypothesis of randomness (the critical values for 10%, 5% and 1% level of significance are 35.2, 38.1 and 
41.7 respectively, very far from the value of the test statistic 9406.1): the algorithm distinguishes high-dimensional chaos from 
randomness. 



D. Topological properties of chaotic series 

Observe in fig. [HI that the series extracted from the Logistic and Henon maps seem to have an associated visibility 
graph with a degree distribution which has an exponential tail, yet different to eq. 1151 This characteristic can be 
explained as follows: First, the tail of P{k) is related to the hubs degree. Hubs correspond to the data series that 
have largest visibility. These are, according to eq. [T71 extreme events in the series, whose degree is truncated by other 
extreme data (statistically speaking). Accordingly, the tail of P{k) essentially reduces to calculate the probability 
distribution of recurrence times in the series. Within random series, notice that this distribution is straightforwardly 
exponential (recurrence times in a Poisson process are exponentially distributed consistent with eqfTSl Within 

chaotic series, recurrence time statistics are related [l7[ to the concept Poincare recurrence time (PRT) j35i], which 
measures the time interval between two consecutive visits of a trajectory to a finite size region of the phase space. As 
a matter of fact, it has been shown that Poincare recurrence times are exponentially distributed in several hyperbolic 
chaotic systems, including the Logistic and Henon maps (see [3§| and references therein). We conjecture that the 
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y2 
A 


Decimal expansion of tt 


19.9 


Decimal expansion of e 


20.2 


Decimal expansion of In 2 


22.34 


Random series extracted from uniform distribution 


23.1 



TABLE I: 'x goodness-of-fit test between the degree distribution of visibility graphs associated to several number theoretical 
sequences and the theoretical prediction for random series. The number theoretical sequences are constructed from the decimal 
expansion of the first 6- 10^ digits of tt, e and In 2, grouped by tuples of 6 elements, what provides series of 10^ data respectively. 
As expected, the normality of tt, e and In 2 is not rejected by the test. 

functional form of P(fc) is closely related for chaotic series with their associated Poincare recurrence time distribution 
(vifhich deviate from the Poissonian statistics (eq. Hn]) due to deterministic effects), something that will be addressed 
in future work. 



E. Stochastic processes versus chaos 

The task of distinguishing determinism from a generic stochastic process (e.g. fractional Brownian motion, high 
order Markov models, etc) is more general and goes well beyond the scope of this work, since our theory only addresses 
series of i.i.d. random variables (uncorrelated random series). However, in a recent work [13] it has been shown that 
fractional Brownian motions and colored noise series map into scale free visibility graphs, which clearly differ from 
the functional form of P(fc) for chaotic series and from i.i.d. theory. In this sense we conjecture that the visibility 
algorithm efficiently discriminates not only uncorrelated randomness from chaos but also more complicated stochastic 
processes such as colored noise or fractional Brownian motion. 

VI. SOME CONJECTURED RANDOM LIKE SERIES: DECIMAL EXPANSION OF NORMAL 

NUMBERS 

A real number K (which can be understood as a series if we pick its decimal expansion) is defined as a normal 
number if for all integer fc, any given fc— tuple is equally likely in the fc— expansion of i?; that is to say, the digits of a 
real number show a uniform distribution in every base [isj . For instance, in a decimal base, if number R is normal 
then every string of size fc is equally likely to appear: for k = A the string 3254 is as likely as 1234, and this holds 
for all k. It is a well-known result from measure theory that a real number chosen at random is absolutely normal 
with probability 1. Interestingly, many fundamental constants such as tt, e or common irrational numbers such as 
In 2 or are conjectured to be normal, but not a single proof exists so far [3]. Now, the degree distribution of 
a visibility graph associated to the series generated by the decimal expansion of a normal number should follow 
equation 1151 In other words, a deviation from eq. [T2] would imply the non-normality of a given number. In table |T] 
we have reported the values of a goodness-of-fit test between the degree distribution of graphs associated to the 
decimal expansion of several conjectured normal numbers (series of = 100000 data) and equation [121 The same 
test has been performed for the case of a random series extracted from a uniform distribution of the same size, for 
the sake of comparison. As expected, the null hypothesis of normality cannot be rejected. Note that this procedure 
can easily extend to other number theoretic sequences which are also conjectured to be random. 



A. Note on flows 

Notice that the theory that we have developed in sections II to IV addresses a series of i.i.d. variables, that 
is, a discrete series. Accordingly, we have compared the results obatained from chaotic maps or from the decimal 
expansion of numbers to this i.i.d. theory, that is, discrete data. Now, it is not straightforward to compare this theory 
with visibility graphs extracted from flows (continuous series), since in any discretization of a flow some continuity 
properties are present, something that is not assumed a priori in the i.i.d. theory. This will be addressed in further 
work. 
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VII. CONCLUDING REMARKS 



In this work we have introduced the horizontal visibihty algorithm, an algorithm that maps time series into graphs 
which is inspired in the so called visibility algorithm [1]. The present algorithm is quite similar to the latter, yet 
analytically solvable. Accordingly, we have obtained exact results on several properties of the horizontal visibility 
graph associated to generic uncorrelated random series, and numerical simulations confirmed its reliability for finite 
series. Concretely, the degree distribution of the graph has an exponential form P{k) = (l/3)(2/3)''~^, the clustering 
coefficient C has a probability distribution P{C) = (1/3) (2/3)^/'-^"^ and the mean path length scales with the system's 
size in a logarithmic fashion, evidencing the Small- World phenomenon [13] . Since the results are independent of the 
distribution from which the series was generated, we conclude that every uncorrelated random series must have the 
same horizontal visibility graph, and in particular the same degree distribution. Thereby, this algorithm can be used 
as a simple test for discriminating uncorrelated randomness from chaos. Concretely, we have shown that the method 
can perfectly distinguish between random series (different probability distributions) that indeed follow the theoretical 
prediction and chaotic series (logistic map, tent map, Henon map) that clearly deviate from the theory. This extends 
to chaotic series polluted with noise and even to high dimensional chaotic series (coupled map lattice). 



Observe that this method diverges from the standard algorithms introduced so far |27| since it makes use of graph 
theoretical techniques to characterize nonlinear temporal correlations of the series, and its recipe is straightforward: 
(i) construct a visibility graph from the series under study, (ii) compute its degree distribution P(fc) and compare 
it to eq. [m A visual inspection (or eventually a goodness-of-fit test between P{k) and equation [15] if needed) 
allows us to reject the hypothesis that the series is random. The algorithm is direct and has low computational cost, 
as opposed to several standard methods. Furthermore, it is not just empirical since it is based in exact results. It 
is worth emphasizing that its purpose is not to quantify chaos but to easily discriminate chaos from uncorrelated 
randomness. For practical purposes, the method should be used as a reliable preliminary test when looking for 
deterministic fingerprints in time series (in this sense, once we have checked that P{k) has an exponential tail that 
deviates from equation[Tni embedding methods should be applied to the series). Whether this algorithm is also able to 
quantify chaos, as well as the relation between standard chaos indicators (Lyapunov exponents, correlation dimension, 
etc) and the topological properties of the visibility graphs are open problems for further research. 
It is also worth commenting that in a preceding work it has been shown that the visibility algorithm is also able 
to identify colored noise series (/~^ noises, fractional Brownian motion), since their associated visibility graphs are 
scale- free jl3| . and an algebraic relation between the exponent of the power law degree distribution and the Hurst 
exponent of the time series exists. In this sense, the visibility algorithm can also discriminate chaos from colored 
noise. 
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